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ABSTRACT 

A bilingual experiment is being conducted at the Lin- 
guistics Documentation Centre, University of Ottawa, 
into the elaboration of well structured formulary 
routines for making the writing of abstracts easier, 
at the same time standardizing and generally augment- 
ing the information given in them. (A 1 1 Informathfcque 
de Linguistique de l*Universit£ d'Ottawa, une expe- 
rience bilingue tente presentement , 5 travers un for- 
mulaire rigoureusement structure, d'&tablir des proce- 
d§s routlnlers qui faclllteraient la redaction des 
resumes signaletlques, tout en favorisant la standardi- 
sation et le nombre d ' informations. ) 
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CAVEAT 

This is a preliminary report of research that has begun only 
recently (Nov. 1972). It is being conducted on a very small scale. 
Even so, not all of the work can be described within the space limit of 
this paper. 

INTRODUCTION & PR INCIPLES 

Harris (1971) has shown how traditional bibliographic descriptions 
—the information and layout usually employed in bibliographies or on 
author-title catalogue cards — are so regular that most of their syntax 
can be described by a formal context-free grammar* Some advantages of 
t h is r egularity a re: 

(i) Interchangeability of catalogue cards and easy reading of 
other people's catalogues and bibliographies. 
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(it) Clearly implied directions to the cataloguer/bibliographer 
^ as to what information to extract from documents; which 

' vastly helps the maintenance of standards in the matter. 

(ill) Fields that are well delimited for information retrieval 
systems. 

i In sharp contrast to this epitome of regularity stand content 
descriptions that are written in the form of running text: the typical 
example is abstracts. (By this definition we exclude from consideration 
Uniterm, multiterm and facetted classification languages,) From the 
linguistic point of view abstracting remains freely creative; but conse- 
quently it lacks the advantages just mentioned. 

Advantages (i) and (ill) above accrue to the user. So does <ii) 
eventually, but it first affects the documental 1st * Those implied 
directions help the latter by telling him: 

(a) What he must not fail to account for, even if it might not 
all appear of interest to him personally. 

(b) The order in w.tich to set it down, relieving him of 
many decisions about the expressions and punctuation to 
use . 

Conversely, misunderstandings ommisslons and garbling are reduced. 

If the constraints of a well tried format make work easier, the 
corollary is that the operation should be faster and cheaper. Even if 
the format is not rigidly adhered to in practice, a training in it 
should habituate an Intellect to seeking out requisite information and 
should leave suitable terminology ready to spring to mind to express it. 
The help that a tyro abstracter receives at present is usually vague 
general advice such as, "Pick out what is new" or "What is the thesis of 
the author?" 

Working with nrid within a regulated framework does not render the 
task entirely mechanical. Nothing of value can be had without allowing 
the bibliographer to make judgements, and the primary judgement to be 
made is that of selecting salient information. Of course much research 
is going on into automating the selection process; however, the potential 
connection with automatic abstracting lies beyond the scope of the 
present experiment* 

There is 4 spectrum of abstracts that ranges from the very brief 
•indicative 1 abstract --of which the abstract at the head of this paper 
is an example—* through the 'informative abstract- which "presents the 
conceptual content 11 of the document , to the critical review which may 
sometimes rise to the status of an important work in its own right* 
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Amongst the criteria laid down for the formulary to be used In this 
experiment were the following: 

(I) It oust be as general as possible In the sense that it can 
be applied to as many documents and areas as possible. 

(II) It must avoid areas of Judgement that are open to wide 
disagreement. Value judgements are to be excluded, 
although they may be put into... 

(ill) Additional comments which are allowed as a supplement to 
the formulated information in case the specifics of the 
formulary neglect some important aspect. Indeed these 
additional comments should provide the basis for expansion 
of the formulary itself. 

(iv) For the same reason, the analyst should be allowed to use 
a term of his own within the formulary when he feels that 
none of those provided fits the document. 

These criteria and a number of other considerations are dealt 
with at greater length in Hofmann (1972). 

METHODOLOGY A , ■ 

As a first step, an initial formulary was drawn up in English. 
This primitive version is reproduced as Fig* 1. When it is compared 
with guidelines issued by a well-established abstracting agency (Fig. 2), 
it is seen to cover much the same ground, and seems to prescribe a 
fullness somewhere between that of the informative and the indicative 
abstract. The difference lies foremost in the help given to the 
abstracter, because at the same time as he is directed to seek information 
he is provided with ready-made terminology and syntax. We will turn 
later to consideration of the product. 

By application of the guidelines quoted above, the formulary has 
been considerably expanded, both in Its terminology and In its constituent 
structure. A recent English version Is Fig. 3. The expansions come about 
when one or other of the two analysts working on the documents (L. LSgarS 
and Mi Gelbert) exceeds the formulary. They then discuss with a 
coordinator whether their addition is really necessary or whether they 
have failed to make full use of what the formulary already provides. If 
their addition is accepted as necessary, it is incorporated in an updated 
version. After five months of this trial and error, the formulary seems 
to have a chance of reaching satisfactory stability; that we would 
define as no further changes in the constituent structure and not more 
than one addition to the vocabulary per 1,000 documents analyzed. 
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LegarS is working on French documents, and while so doing is 
compiling a French translation of the formulary which will be published 
later ♦ 

As a rough and ready way of early evaluation, we have had the 
analysts do abstracts of documents that had already been abstracted in 
reputable Journals, namely "Language and Automation 11 and "Language and 
Language Behavior Abstracts"* Of course our analysts did not see the 
other abstracts before doing theirs. We then compared the items of 
information given in the paired abstracts* An example of this sort of 
comparison is Fig* 4. 

Approximately 50 documents have so far been analyzed. 

EXPANSION OF THE FORMULARY 

The increased flexibility , subtlety and precision provided by 
the larger vocabulary in Fig. 3 is obvious. However, it also contains 
some deadwood in the shape of expressions that were in the original 
formulary (Fig* 1) but have not proved their worth. Sooner or later, 
when we have enough data to do it safely, we shall have to prune. 

At this point an explanation of the notation used in Figs* 1 and 
3 is called for. 

groups a set of terms out of which the abstractor may 
(in some cases must) chose one term each time be works 
through the formulary. 

(....) surrounds sets that are optional. The abstractor should 
always consider whether they are applicable, but often 
they are not. Conversely, sets that are not in paren- 
theses are obligatory. 

[••••J surrounds an explanation which is intended to guide the 
abstractor but is not for use in his text. 

CAPITALS distinguish the literals, i.e* the terms themselves 
that are to be used. 

Underlined lower-case letters are used for variables. 

Variables s x, jf , £ for noun phrases, £ for sentences, If or lan- 
guage names or types; / separates alternative variables* 

The expansion ~- a significantly increased complexity — in the 
constituent structure reflects the complexity of the documents themselves 
and was demanded if one wanted to retain more than the very briefest of 
indicative-type information. 



21 



FABS 

The sophistications are of three kinds: 

(i) The split of the original sections II and III into II. Ill 
and IV so as to introduce certain information that is now 
provided for by III and take some of the functional load 
off II and IV. 

(il) The addition of several optional constituents: see the 
parentheses notation above. This gives more flexibility. 

(ill) Footnoting, to accommodate the additional comments pres- 
cribed by the criteria* Footnotes can also be used for 
bibliographical references. 

(iv) Most important perhaps, the introduction of the iterative 
mechanism and its notation, (see below). 

ITERATION 

A single straight pass through the formulary would still only 
permit a brief indicative abstract. Indeed one way to force brevity Is 
to insist that it be used that way (cf. Fig. 5, no. 3). 

For analysis in greater depth, however, it soon became clear that 
a way was needed to put in more Information, and this without having to 
make the distinction between essential and marginal information that 
footnotes imply. On the other hand, the general aim of the project 
required that the syntax be kept formally simple and, as linguists would 
say, transparent 1 (i.e. marked overtly). The solution was to allow any 
amount of 'backtracking 1 ; at the extreme one can do a complete 'da capo 1 
from section IV to section I. So that Instead of being restricted to 
the order 

(i) I -* II -t III-* IV 

one can go 



(ii) I r* II-* III 



or 



II -> III-* IV 



(ill) I -* II- II W IV ^ 

Sir' 



II 

B III IV 



and so on. 
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Each backtrack is marked by the special conjunction 'A 1 , or by a 
string of 4s in which there is an additional A for each section backwards 
that one moves. Translated into this notation, the above three examples 
are rendered: 



(i) 


I 


II 


III 


IV 






(ii) 


I 


II 


III 


4411 


III 


IV 


(iii) 


I 


II 


III 


IV 


44441 


II 411 411 III IV 



The form f and' Is reserved for normal lntra-sententlal use within a sec- 
tion. 

Though this may sound formidable, we feel the product can be read 
without strain: see the examples in Fig. 5, where some help is given to 
the eye by paragraphing. 

INTERIM CONCLUSION 

So far as we are aware, no other research is being done in this 
direction. Our immediate aim is to arrive at a satisfactory formulary; 
that is to say one whir i is comfortable and helpful for the abstractor, 
readable and useful for researchers, easily parsed by computer. We are 
feeling our way within these constraints. Reduction in the cost and 
Increase In the speed of abstracting are very important problem areas, 
but cannot be tackled until an ulterior stage of research and would 
require sore substantial funding. 
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FIRST VERSION OF FABS FORMULARY 



Section: I. Performative Verb 



II. Aspect, Typ e 



[The author} 



PRESENTS \ 
ASSERTS 
DENIES 
PROPOSES 
REPORTS 
INTERPRETS 
SUMMARIZES 
^SPECULATES 



A 

Ian 
|tke 
(that; 



S EXPERIMENT 
DESCRIPTION OF x [aspects] OF 
[language] 1 
PROOF THAT s 
[fact] s IN [language] 1 
[area of knowledge] x/s_ 
EXPERIENCE [case ' histories, 
personal studies] OF x 
k [event] x/s_ 



III. Relevance 



©-»{.} 



i GENERALIZING FROM z \ 
REPEATING z 
REPLICATING z 
MAKING MORE COMPLETE z 
' MAKING MORE ACCURATE z 
MADE BY z 
^ PROPOSED BY z / 



Fig. 2: 



TYPICAL GUIDELINES FOR ABSTRACTORS 



Sources "LLBA". 



IMORMAUVt ABSTRACT 

An informative abstract presents (he conceptual content of an article Summarizing the essential ideas in an article, the 
abstract shcuik! amwor the following questions 

1. What is the thesis of the author* Whai hypothecs or theories are presented? 

2. How are the main hypotheses developed? What data aie used? What methods of isolating data are used? Was novel 
methodology employed ? Ate the data qualitatively and/or quantitatively manipulated? What tests, scale, indexes, or 
other summarizing techniques are used? 

,v What ate the proofs or evidence relevant to the hypotheses? 

•I What conclusions are drawn? Are the hypotheses/ideas, concepts, theories, etc., supported or rejected? What new 
relation ships are found, ofd ones reJffirnied or rejected? 

|he wtorniauyc abstract should show ihe meaningful, coherent relationship between the author's ideas and 
:* > g o 1 1 > e*r 1 1 >; f u 1 1 her in 4 >r e , it slum hi enable the researcher to see ihe difference between one arlale and others on the same 

: subject 

INDK-ATIVr ABSTRAC T 

Some at tides teg., bibbojjijphies, review jMicles, reports, and the hkc) cannot readily be somman/ed and require 
indicative abstracts which serve pmnari^y as descriptive guides. An indicative abstract tells briefly what an uriiele is about, 
what significant subjects it includes, and what its scope is. 
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Fig. 3: FABS FORMULARY AT APRIL 1W 



Section: I. Performative Verb 



[The author] 



({»'-}) 



'presents 

ASSERTS 
DENIES 
PROPOSES 
REPORTS 
INTERPRETS 
SUMMARIZES 
SPECULATES 
SUPPORTS 
<^ POSTULATES 
HYPOTHESIZES 
ANALYZES 
DISCUSSES 
APPROVES 
ADVOCATES 
QUERIES 
DOUBTS 



EXPLAINS 



IN DETAIL 
IN SUMMARY \ 
IN BRIEF 
IN PART 




AN ARGUMENT (FOR) 
ADEQUACY OF 
<J INACCURACY IN 
EXHAUSTIVENESS OF 
SOLE VALIDITY OF 




III. Means/Aims 



('(USING x 
CsStfr - <with 2) 
^EXEMPLIFIED BY x 




FOR THE PURPOSE 0T_s/t 
WITH A VIEW TO s/z 
^ TO ACCOUNT FOR s /x 
INFERRING (THAT) s/z 
CONCLUDING THAT 8 



0 
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II. Aspec t /Type 




TAXONOMY OF x 

EXPERIMENT TO a/QONCERNING x 
DESCRIPTION OF~x ^specta] (AS J.) 
PROOF THAT s/OF x 
[fact] s 
GRAMMAR (OF x) 
area of knowledge x 
(PERSONAL) EXPERIENCE (OF x) 
APPLICATION OF x (TO 
(fevent] a, x 
THEORY OF x 
INFLUENCE OF x ON £ 
CONFUSION \BY~*x) OF £ (WITH z) 
METHODOLOGY FOR s/x 
DATA (FROM x) OF £ 
EXPLANATION OF x (BY in terms of y) 
MODEL OF x (FOR jr) 
ANALYSIS OF x (INTO £) 
TEST OF x 
PROGRAM FOR s/x 



> 



J LANGUAGES IN GENERAL 
1 [language] 1^ 



V 



IV» Relev a nce. A ntecedents 



V 



GENERALIZING FROM x 
CONFIRMING X 
REPLICATING x 
CONTRARY TO x 
COMPLETING x 
EXTENDING x 

MAKING MORE ACCURATE x 
^ASED ON x 



\ 



/ 



/Tmade by i 
(< proposed by 
\ ^reported by £ 



4) 
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* : EXAMPLES OF FABS ABST RACTS 

Paillet, J.P. A Hofmann, T*R. , "Assumptions of Integrative Semantics 11 ! 
ln Integrative Se mantics I (SRG Monographs), 1972. 

PRESENTS IN BRIEF an ANALYSIS OF semantics INTO contents of messages 
and contents of lexical items and praxis phenomena 

&&«& PRESENTS a formal DEFINITION of a C-net AS an oriented graph of 
relationships between labelled nodes (semantic atoms) and unlabelled 
nodes standing for individuals to be eventually related to a referent 
in a universe, FOR THE PURPOSE OF representing the content structures 
of discourse 

&&«& POSTULATES the ANALYSIS OF lexical items INTO subnets of C-nets 

&&&& PROPOSES a DEFINITION OF understanding AS successful integra- 
tion of the current segment of a discourse into the C-net already 
built up. 

De Possel, R. 9 "Les Rgsultats obtenus depuis fin 1968 en reconnais- 
sance des formes et en particulier en lecture automatique par le 
R.A.M.I. 11 , T.A. Informatio ns, 1972, no. 2, p. 22-24. 

REPORTS IN BRIEF 1 RESULTS FROM an automatic reader OF roman charac- 
ters USING a contextual (digraph and trigraph) and semi-sequential 
model FOR PURPOSE OF reading print and computer-output microfilm. 

1 Full text to appear in Automat leme > 

Moinfar, D«, "Defini et non-d6fini en persan" , T. A. Informations , 
1972, no. 2, p. 20-21. " ' ~. 

PRESENTS MODEL OF recognition of definite and indefinite functional 
values IN Persian USING a table which relates marker and content. 

VaissiSre, J., "Contribution S la synthase par rlgles du fraugala", 
T.A. Information s, 1972, no. 2, p. 1-16. 

PRESENTS IN SUMMARY 1 a GRAMMAR OF & a PROGRAM FOR generation of 

prosodic features IN French BASED ON a phrase-structure syntax 

i ON number of syllables per phrase, FOR PURPOSE OF speech synthesis. 

1 of A. *8 Thesis 
2 

that of CETA, Grenoble. 



l.f. THESAURUS OF BUILDING SCIENCE AND TECHNOLOGY 
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ABSTRACT 

The I«F. Team have been struggling with the problems 
of information storage and retrieval in building! to 
the point that the basic tools had to be developed. 
These tools had to be explicit in their structure and 
construction, to compensate for the diversity of the 
building industry audience. The first of these tools 
is a hierarchical thesaurus, distinguished by its 
structure (a semi-lattice with nine levels) and its 
construction (based on a set of logical propositions)* 
Candidate terms are treated in pairs, using a set of 
questions to ascertain systematically what are the 
relationships between them. Data processing helped 
with the construction of the thesaurus, particularly 
in terms of methodology. There is a scope for further 
work, particularly in preparing the bilingual (English- 
French) version of the thesaurus, now in draft* 
(L'equipe I.F. a lutte avec les probl&mes de 1'enregis- 
trement et de la recherche de 1 1 information dans le 
domaine de la construction; fort decette experience, 
les membres de requipe se sont rendu compte qu'il 
fall ait developper les outils de base. Ces outi Is 
devraient avoir une structure et une construction clai- 
res f afin de compenser la diversity de r audience; les 
membres de 1 1 Industrie du batiment. Le premier de ces 
outils est un thesaurus hierarchique, sa structure (un 
semi-reseau S 9 niveaux) et sa construction (basfie sur 
une sSrie de propositions logiques) representent des 
innovations. Des termes candidats sont traites par 
paires, en se servant d'une serie de questions afin 
d 4 identifier systematiquement la nature des liens 
entre les termes. L' informatique a aide avec la cons- 
truction du thesaurus, surtout'en ce qui concerne la 
methodologie. II y a de grandes possibi li tes de con- 
tinuer 1e travail , surtout pour la preparation d'un 
thesaurus bill ngue (anglais- frangais) actuellement a 
V 1 etude). 
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